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Incorporating Partially Identified Sample Segments 
Into Acreage Estimation Procedures: Estimates Using 

Only Observations From The Current Year 

by 

Robert L. Slelken Jr. 

Institute of Statistics 
Texas A&H University 

December, 1981 


ABSTRACT 


For the purposes o£ area estiioation a segaKBnt la said to be completely 
ldentl£led If the acreage o£ each crop o£ interest can be directly estiinated* 

A segment is said to be only partially identified if some individual crops vithin 
a family of crops cannot be differentiated and their acreages individually esti- 
mated. For example, if it estimated chat 30JSS of a segment is spring small grains 
but no further differentiation into say spring wheat and other spring small grains 
is made, then the segment is only partially identified. Several methods of 
estimating individual crop acreages using a mixture of completely identified and 
partially identified segments from a single growing year are derived and discussed. 

A small Monte Carlo study of eight estimators is presented. The relative empirical 
behavior of these estimators is discussed as are the effects of segment sample 
size and amount of partial identification. 

The principle recommendations are (1) to not exclude, but rather incorporate 
partially identifed sample segments into the estimation procedure, (2) try to 
avoid having a very large percentage (say 80%) of only partially identified 
segments in the sample , and (3) use the maximum likelihood estimator although 
the weighted least squares estimator and least squares ratio estimator both perform 
almost as well. 

In this report the family of crops which cannot always be completely identified 
is assumed to contain a particular crop of Interest and the rest of the family is 
essentially considered collectively as simply a second crop. For example, the 
family of spring small grains might be subdivided into two crops, say barley and 
"other spring small grains'*. 
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!■ Introduction 


A samp la of oagmaity,te within a stratum la solactod. Bach segmant Is obsarvad 
ropaatedly through the growing saasons. The objective Is to estimate for each 
crop of interest the proportion of the stratum's acreage corresponding to that 
crop's harvested acreage. 

In this report, we shall assume that there are only two crops of Interast. 
Furthermore, we shall assume that only data from the current growing year are to 
be used In estimating the crop at harvest proportions. The ''ases corresponding 
to more than two crops of interest and data available from more than one growing 
year will be considered in subsequent reports. 

The sample segments are all assumed to be of the same size. No assumption Is 
made about the stratum size or the segment size. The sampled segments are assumed 
to represent a random sample (without replacement) from the segments In the stratum. 

Each sample segment is assumed to have been observed at least once during the 
growing year and possibly several times. The two crops of Interest will be desig- 
nated as crop A and crop B. When a sample segment Is observed, the observation 
can have the form 

^^A* ^ other^ 

where 

" the estimated proportion of the segment which will be harvested 
In crop A, 

Pq « the estimated proportion of the segment which will be harvested In 
crop B, and 

** other " ^ ” **A ” ^B " estimated proportion of the segment which 

will not be harvested In either A or B. 


- 2 - 

s 

Al6tttiiativ«ly» ttstlmAesB noy not bo mado on A and B aapai ataly but only on A and 
B collactivaly* ao tliat tha obaarvation can Hava tha form 

^^♦B» ^othar^ 

whara 

• tha eatlmatad proportion of tha aagmant that will bo harvaatad 
in eithar A or D» and 

^other * ^ eatimated proportion of tha ee^ont that will 

not ba harvaatad in A or B. 

Thaae aatimataa are assumed to reflect any previous observations made on this sample 
segment during the current growing year. If a sample segment's current observation 
is of the form 

^*A+B* *other* 

then the s^mnple segment will be called partially identified. If its observation is 
of the form 

^•a* *B» *othet^* 

then It will be called completely identified. 

The proportion of the stratum harvested in crop A will be denoted with Pg 
similarly defined. The objective is to estimate P^ and Pg using the observations 
on the sample segments. This estimation may have to be made at more than one time 
during the growing year. Of course, if there are no completely identified sampled 
segments, only the sum P^ -f Pg can be estimated on the basis of the sample segments. 
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2. Eaeimation Proc»dur»» 

Thsrs ar« undoubeadly iaverol dacision iBskara wichln a atrakun who collactivaly 


detarmina tha proper eional aeraagaa (P^» ^orhar^ afcrattBB. Tha «cact 

daeialon making proeaaa la unknown. Hanca* cartaln aoaumptlone will ba mada about 
tha outcome of this daeialon making proeaaa t and thaaa aaaumptlona will auggaat a 
few reaeonabla eatlmatlon proeadurea for eonaldaratlon. Tha aaaumptlona ara at baat 
approxlmatlona and are no doubt taehnlcally not eatlaflad. The only role of thaaa 
aaaumptlona la to auggaat eatlmatlon proeeduras. Tha worth of thaaa proeadurea 
will not be determined relative to tha underlying aBBumptlona which auggaatad tham» 
but rather determined by their behavior on real aampla data. 

The number of aeres harveated In a aegment, aay (X^» H -> * Xq)» will 

be aaaumed to follow a multinomial dlatrlbutlon with ' 

P(X^ - x^, Xg ■ Xg) 

- N Pa*^ P«*® (1 - P. - Pg) . (1) 

*A= ’•b' - *A - V' 


This assumption would be technically correct if every decision maker acted Independently 
and for each unit of acreage chose crop A» crop B, or other with probabilities 

^other ’'^®®P®*^*^^vely. The CAMS estimated segment proportions (p^, Pg, 

are assumed to correspond to 

_*A 5a_. ” ' . 

N M N 

The following notation will be used throughout this paper. The total number of 
segments sampled will be denoted by 

n •* segment s^ple slse (2) 

The sample segments which have been completely identified will be denoted by 

^^Ai* PbI* 

Similarly, the sample segments which have only been partially identified will be 




ORIQINM.PA»n 
OF POOR QUALfTV 


d«nottd by 


I (p^ + ^ ( 

Hones 

n ■ I + J. ( 

Since no ooparato eatimate of and Pq could be nada aolaly on bba baaia of the 
aamplad aegawnca imlaaa aoma aaoqpled aagDonea had bean complafaly idaneifiadi ic 


will be aaaimed that 


1 > 0 


Under the aaaumed multinomial distribution (1) the joint likelihood for the 


CAMS estimates is 


*Ai' 


P*A-t (1 - P^ - Pg) **“*A1”*B1J 


•[a 


«A + V ■"■«A + VjJ' 


(Pa + Pb>^V V C 1 -(p^ + Pb>i”"^a ■** *b^^ 


or, equivalently, 


{ n n; 

Wp..)! (NPa.)! 


i^l)! (Sp^t)! m - Np^ - NPbi)! 


N-NP^l-NPjt 


P^PaI ,;Pb 1 ^,.,^.,_^,-»PAi-«PBtj . 

fn 

(N(Pa + Pb>j 1! «i - (Pa + PB>jl)’ 


“■a + ■ Pa - Pb>“‘' 


(8) 




2 


: •' rfe-' * '1C T r! '.2/ . " 3 - . 




ORIGINAL P/^ W 
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wieh rospflct eo P^aiul P^i then (f^t X - mtiM t)t thiB liMt f^t»rM 

Mtloiu of (P^, Pg, 1 - P^ - Pg), Th« fttuieing isasfe i^uim oofiaMitot m 

■ { w ■" L * li "■ ■" ’ 

and 

Pj . f(31 + MU^PbI ■ 


nm 4* 4j). 


1 ‘ *• 


2.3 .Hflightad UftBt Saiiaraa gitiaaf 

Sines one may wish to glvs mors wsight to mors prsclss (loss vsrlsbls) 
sstlmatss than to Isos prseiss (mov^o vsr labia) astlmatss, a wsightsd Isast s<iuarss 
sstimata of (P^, Pg, 1 • P^ « Pg) may be preferable, fhe weighted least squares 
eotlmate minimizes 
I 


«A1 <0aI - V* + - V* * 

”a+B, 1 f " PaI * PbI^ ‘ “ Pa ” 


JiVb. 3 ‘ <^A ♦ "B>3 - * VI* * {ll - ♦ '■b>3’ - 

U-(Pa + Pb>J H 

with respect to and Pg. The corresponding weighted least squares estimates 
would be 


(15) 


B * tfi^A+B,! * ^ 


A1 + V> * ^ jfi^B.J ^^A + »B>j - 
[j,«Ai * J>B.i ^ ^ j>B.j] 


P 


-7 


and 


pfm » 

OP POOR QIMUTV 


f. 


ft j>0»l ♦ ♦ Pm> + 2 Ji"a«.J<Pa ♦ 


1 z 

[ £ H*. 4- 5 W 


4- 2 £ U, 


i-l j;fA4.B,J 


1 - 


^ j^^AlPAl * Jj^A-^B.I^PaI * Pb1> (|»^ 4* J • 


I J 

i E Wa4.«44 + 2 E 


1-1 d-1 


"A4-BJ1 
I I 


Vb.J>] / [l j> - j>B,i * * J>B.l 




( 16 ) 


If tha walghta, W, are all takan eo ba ona, than tha walghtad laaat aquataa 
eaclmataa (16) ara aqual to tha laaat aqtiaraa aatlinataa (14). A raaaonabla 
altarnotiva to equal walghte* aay W'a - 1, la to weight each aquarad dlffaranea 
Invaraaly In proportion to tha approximate variance of tha CMS aattoata It eon- 
taina. Specifically • 

”a1 " " V 

Mgi - l/Pgd - Pg) , 1-1, 2,..., I. 

Vb.I ■ - K -®B>l<^A + Vl • 1-1.2.— I. 

«A+B,j’l/‘^*Vj<l-^A- Vj . 3-1.2.— J. (17) 

Tkeae weights put more weight on the differences where tha CMS estimates are 
more precise (less variable). 


The weights (17) use estimated variances corresponding to the multinomial 
distribution assumption. The actual calculation of the weighted least squares 
estimates would be an iterative procedure. Initially, the weights would be set 
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OF POOJI 


•quAl to ono, thtn and P|| dotomlnod item (16). 7t»ii ioitUi •itiaitoi 

•fid P- would bo oubotitutod Into (17) e© dotornifio tlio mig^ta for cho fooood 
B 

itorotion. Tho ovoliiotlon of (16) now yioldo tho ooeond lit of iotliiiitoo 
and P^, Aftor a aodoroto numbor of itor«tiono« ooy 4» Clio Odd tbould 
ehongo vary llttla and will ba takan aa tba waightad laaat aquaroa aatimataa. 

2.4 Tha Combination of a Uaat gflutigia Bafeio MtiMta_iiid_a^to 

Proportion Eatimata 

An altamativa approach ia to uaa only tha eonplataly idantiflad aasipla aa^nta 
to aatimata 

“a ■ V<^A + V 

whara ia tha rolativa proportion of crop A among tha aeraaga proportion for 
crop A or B. Similarly! 

■ V‘^A ♦ V ■ ^ - “a- 

* 

Thm>, m-ying aatlmatad and banco E^t aatimata tha ovarnll atrattmi proportiona 
P^ anO Pg by 


’ (19) 




. t 


A4-D 


and 


Pb " % 


f 


A+B 


whara la an aatimata for the combined acreage proportion 

AtII 

^A+B ■ ^A *'■ • 

Tha combined acreage proportion could be estimated by the maximum 
llloalihood estimate 


'A+B 




+ j) , 


Other estimates of P^^q could be used. 

The relative proportion R^ can be estimated by minimizing 

Mpai • •*a<Pai + + .MObi - ■ V <^1 + "Bi” 


1-1 


1-1 


( 20 ) 


( 21 ) 


( 22 ) 


< 9 ) 


with resp«ct to R^. The resulting least squares esttoator of R^ is 






iliililllliiii 


( 24 > 
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Empirical BthBVlor q£ 8o«i Bitimai;ton_Egoeadugai» 

Soinaona once said, "Tlia proof of rha pudding la In eha Caatlng.*' Similarly, 
hara cha proof of an aatlRuielon proeodura*a valua liaa In ita aeCual parformauea on 
real data. To loam about the empirical behavior of the four altamativa aatlmation 
procedurea 

(1) maximum likelihood aatimatora, (9) - (11), 

(2) laaat squaraa oatlmatora, (14), 

(3) weighted leaot equaraa aatimatora, (16), and 

(4) thb combination of a leant aquarea ratio eatimator, (24), and the maximum 
likelihood proportion eatimator, (22), 

a Moute Carlo study waa performed. The atructure of the study is discussed in eub- 

eectlona 3.2 and 3.3* The computer output from the study la reproduced in Appendices 
A and D. The conclusions reached from the study are given in subaecti/m 3.4. * 


3.1 Potential for Bias 

When some of the sampled segments are only partially identified, it la poaelble 

for the estimators of (P^, P^, l-P^-P^) to be biased. 

To demonstrate this potential for bias, the following simple example is given. 

The stratum Is composed of two segments. Segment one has (p., p„, l-p.-Pn) • 

A o Ad 

(.5, .5, 0), and segment two has (p^, p^^, l-p^-p^) -(.2, .6, .2). Hence, the stratum 
has (P^, Pg, " (*35, .55, .1). Suppose that the maximum likelihood estimator 

(9) - (11) is to be used. One segment is to be partially identified and one completely 
identified. If segment one Is the one completely Identified, then 




y.5 +.5 4- .8> 

.5'^ 1 1 ' 


.45 


If segment two Is the one completely Identified, then 


A 



.2 


.2 + .6 


)(- 


2 -f .6 -f 1 , 

1 + 1 ^ 


.225 


Thus I th« cxpscead valut of is 

ECp^I ■ .5(.45) + .S(.225) ■ .3325 . 

Hsnca P^ is biaesd since .3375 ^ .35. 

3_._2 _Mon£_e_ Carlo Study. 

The slmulacion of ehe behavior of the ostimaeion procedures was based around 
two real sets of CAMS estimates. The 1977 North Dakota CAMS esticuites of 

crop A " spring wheat 

and 


crop B ■ spring small grains other than spring wheat 
formed one set. The second set was the 1978 North Dakota CAMS estimates of 

crop A ■ barley 


and 


crop B ■ spring small grains other than barley. 

There were 83 segments comprising the first ^et and 76 segments in the second set. 

All simulations were done on each data set separately. 

A ^ A A 

To obtain a single observation on (P^, P^, 1-P^-Pg) for each estimator the 
following procedure was performed: 

1) Simulate a stratum by sampling without replacement segments from the 
data set. 

11) Select a sample of segments from the simulated stratum by sampling without 
replacement segments from the simulated stratum, 
ill) Calculate all estimators. 

This procedure was repeated 1000 times for each data set, stratum size, segment sample 
size, and fraction of the segmen' sample that would be only partially identified* 

Stratums of size 40 segments and 20 segments were considered. The segment sample 
sizes were 5, 10, and 15 when the stratum size was 40 segments and were 5 and 10 when 


the stratum size was 20 sagmiants. For each stretina size and sepisnt swsple sizei 
20% • 40% » 60%, and 80% of the segmants were partially identified* 

3.3 Evaluation CrlteiAiL- 

Thare are several possible ways to measure the sample behavior of the estiwaters 

A' AAA 

For each estimator and each of P^, P^, and l-P^-P^ the following measurea were calcu-^ 
lated for each data set: 

(1) average absolute error " the average over 1000 simluations of 

" ^Stratuml ^stratum ®'»® actual crop proportion 


in the particular simulated stratum* 

A 

(11) average squared error ■ the average oyer 1000 simulations of (P - P 


) 


stratwn 

A 

(111) bias of average estimate ■ the difference between the average (P) In 1000 
simulations and P^^. where Is the actual crop proportion in the 
entire set of segments* and 
(Iv) sample variance of the estimator. 

Some Information la* of course* lost when some segments are only partially 
Identified. To assess this loss* the maximum likelihood estimators were also calcu- 
lated using the complete identification for all sampled segments. Since these esti- 
mators have complete Information for the entire sample of n segments Instead of 
complete Information on only some of the n segments and partial Information on the 
remainder* these latter estimators should perform better. 

To show that the Inclusion of the partially Identified segments Into the 
estimation procedure is better than simply Ignoring them, the maximum likelihood 
estimators* least squares estimators* and weighted least squares estimators were 
also calculated using only the subset of the n sample segments corresponding to the 
completely Identified segments. 

The observed behavior of each of these eight estimators Is detailed In the 
computer outputs reproduced in Appendix A for the 1977 spring wheat data set and 
Appendix B for the 1978 barley data set. 


3.4 Indlcatlotti of che Motif Carle Study 


Smaroarlae of Cha obaarvad avaraga abaoiuea arrora for eha bafitar aatinuieosi 
aro glvan In Tablaa 1 and 2 for tho 1977 and 1978 data aata raapactivaly. Tha 

t 

avaraga ovar tho ant Ira data aata of (p^» Pg* ^ ~ Pg) 

1977» (.235, .073, .892) 

1978: (.053, .251, .698). 


Hance It is not aurprlaing to obaarve that tba bahavlora of tho aotimatora for A in 
1977 ara vary similar to those for B in 1978 and, corraspondingly, the estimators 
for B in 1977 and for A in 1978 have nearly identical behavior. Also, the behaviors 
of the estimators of 1 - in 1977 are nearly the same as the corresponding 

behaviors in 1978. 

The unweighted least squares estimators given in (14) are substantially in- 
ferior to the other estimators, and hence their behavior is not summarized in Tables 
1 and 2. There are much smaller differences among the remaining estimators. 

When both the partially and completely identified sample segment estimates 
are Incorporated Into the maximum likelihood estimator (MLE) given in (9)- (11), 
the weighted least squares estimator (WLS) given in (16) , and the combination of 
the least squares ratio estimator (LSR) given in (24) and the maximum likelihood 
proportional acreage estimator given in (22), all three estimators have similar 
behavior. Overall, the MLE is slightly better then the VLS, and the WLS is slightly 
better than the LSR. The differences are only slight since the average absolute 
error for LSR Is never more than 110% of that for MLE. The average squared errors 
tell the same story. There are no significant differences among the biases or sample 
. variances for the MLE, WLS, and LSR. 

The effects of the sample size and amount of partial identification are 
essentially the same for MLE, WLS, and LSR. When the underlying data set propor- 
tional acreage is nearly 25% as it is for in 1977 and in 1978, then the average 
absolute error decreases as the segment sample size, n, increases roughly as 


lA 


tollOWB 

.04 for n ■ 5, 

.03 for n and 
.02 for n -X 

Hhon tho undorlylng daea eat proportional acraaga la naarly 5S aa It la for i 

I ■ : 

in 1077 and in 1978 • than tha avaraga abaoluta decraasaa vith incraaaing aagoant | 

j . 

aomplo aiza roughly aa follows | 

.02 for n ■ 5» 

.013 for n ■ 10, and 
.01 for n «■ 15. 

Thara ia roughly a quadratic Inereaaa in average absolute error as tho amount of 

partial Identification increases from 20% to 40% to 60% to 80%. The average absolute I 

error increases around twice as much when the amount of partial identification 

increases from 60% to 80% as it does when the amount of partial identification 

increases from 40% to 60%. 

All of the estimators seem to slightly (%%) overestimate the combined propor- 
tional acreage for A and B when the segment sample size, n, is 5. This bias drops | 

to ^ when n ■ 10 and essentially disappears at n * 15. 

The VJLS does not seem to be as good as either hELE or LSR when there is only | 

I 

one completely Identified sample segment. 

Tables 3 and 4 indicate the slight increase in average absolute error caused 
by only partially identifying a segment's acreages instead of completely identifying 
. them. Only the behavior of MLE is summarized since HLS and LSR should behave 
similarly. Naturally the increase in the average absolute error increases as the 
amount of partial identification increases. For less than 80% partial identification 
there is always an increase of less than .01. For 80% the largest increase is less 
than .017. At 30% partial identification the average absolute error can double. 

Of course, if there is 100% partial identification, then there are no individual 


crop estimates. 


i '■ 

' ■ ■ ■_/ 
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4 

Tabitts 5 and 6 indicaCa eha avaraga id)aoluba arcora both ^th and nitbout 
tha partially idantlfiad aampla aagoante balug includad in tha aatinatoca fOJS and 
HLS. Thaae tablaa ara praaantad to axaii^lity tha valua of at laaat pattialiy 
Idantlfylng a aampla aagmont i^ataad of alii^ly diaragarding it if it cui't ba 
complataly idantifiad. Tha pareantaga incraaaa in tha avaraga abaoluta arrora 
when the underlying proportional acreage ia nearly 25% ia uaually lit laaat 


5% - 10% 

for 

20% 

partial identification* 

10% - 25% 

for 

40% 

partial identification! 

25% - 50% 

for 

60% 

partial identification! 

50% - 100% 

for 

80% 

partial identification* 


The percentage increaees are eomewtiat emaller whan tha underlying proportional 
acreage is around 5%» 
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4. RaconfflKindiit loni 

On ehtt basis of fehs lintod Monta Carlo study eha following racomndaelona 
ava madas 

1) As long as thara ara sods complataly Idantifiad aaa^la sapMnta, 

It is raasonabla to astlmata tha individual crop proportions in tha 
stratum. 

2) Xt is prudant to avoid having a larga parcantaga (say 80%) of only 
partially idantifiad sampla sagments. 

3) It is much better to incorporate the partially Idantifiad ample sagments 
into the estimators than it is to disregard tha partially identified 
ample segments. 

4) The maximum likelihood estimators seem to be the best estimators » but 
they are not greatly superior to weighted least squares estimators or 
the use of a least squares ratio estimator. 







Tabu 1, A Suffliuiey o£ tha 4 

Walghtad Uaae 8q2raa^E«iltl!f..®**?*^ Htasiaw MkmUhnnA « 

8*tio Saeiaator aSR)^lth ^ 

■* ■ * ■ ' ' 

frnJf"* S«gMne« 

^pl» SMpia <taiy 

Slaa 8180 Partially 

Xdontlfled \ 0 


Batltnatoifi 


^ 


HLB 

•0«4 .0424 .0431 .0156 .0157 oZl .Z ^ 

•0W3 .0435 .0442 .0177 0176 n«f ’®*“ •"*“ 

.0442 .0452 .0450 0210 '®*“ 

•om .0621 Z 'Z o! ! •«« 
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•0«3 . 0347 . 0333 .05“ 'If •»“0 

.0536 .0515 .0514 .OS 15 .oji« 


•0116 „0U6 .0125 1 .0294 
.0136 . 0136 .0146 1 .0282 
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.0092 . 0091 .0100 1 
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•0129 .0125 .01391 
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.0205 
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4 
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4 
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h Sianacy of tba Avaraga Abioluea Berota for tba IMtta l4kaMhood Batiintor 
(HLB>» and eha Haighead laaae Sqttaraa Batiaaeor (ini9) » boeti Wieh and Wifetione 
tha Partially Xdantifiad Saapla Bagnanta on tha 1977 Morth Dakota gprioi Ubaat 
Data Sat 


SagDont Sagnanta 
Sample Oply 

Slaa Partially $ ^ 

Xdantifiad ^ ^S 


Eatimatort 

With Without 

WLS 

With Without 

lOj; 

With Withnut 

wu 

With Without 

5 1 

.0424 .0476 

.0424 .0476 

•0156 .0166 

.0157 .0166 

2 

.0433 .0566 

.0435 .0566 

.0177 .0201 

.0176 .0201 

3 

.0442 .0693 

.0452 .0693 

.0210 .0254 

.0208 .0254 

4 

.0491 .1030 

.0621 .1030 

.0281 .0365 

.0410 .0365 

10 2 

.0276 .0324 

.0276 .0324 

.0106 .0115 

.0106 .0115 

4 

.0288 .0381 

.0288 .0288 

.0120 .0135 

.0122 .0135 


.0298 .0491 

.0299 .0491 

.0144 .0176 

.0145 .0176 

8 

.0328 .0725 

.0334 .0725 

.0194 .0248 

..0199 .0248 

15 3 

.0206 .0245 

.0206 .0245 

.0079 .0086 

.0079 .0086 

6 

.0207 .0289 

.0207 .0289 

.0093 .0107 

.0095 .0107 

9 

.0216 .0386 

.0217 .0386 

.0108 .0140 

.0110 .0140 

12 

.0249 .0578 

.0251 .0578 

.0160 .0204 

.0160 .0204 


5 

1 

.0387 .0443 

.0387 .0443 

.0149 .0163 

.0150 .0163 


2 

.0385 .0526 

.0387 .0526 

.0172 .0194 

.0173 .0194 


3 

.0417 .0682 
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.0199 .0242 

.0199 .0242 


4 

.0464 .1005 

.0610 .1005 

.0287 .0359 

.0439 .0359 

10 

2 

.0231 .0278 

.0231 .0278 

.0085 .0096 

.0086 .0096 


4 

.0232 .0338 

.0233 .0338 

.0105 .0121 

.0108 .0121 


6 

.0241 .0453 

.0242 .0453 

.0131 .0163 

.0132 .0163 


8 

.0286 .0706 

.0394 .0706 

.0185 .0241 

.0192 .0241 
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Tabic 6. A SmMfy of tha Averogo AbaoiuCo Bfrofi for fha Maxiaiaa iifeiiife7«^ Itetiiiatoir 
(MLB>» and fho Waigheod Laaaf flguaraa Baeinator <ta<8)» both With aod diebout 
eha Parfiaiiy ZdonClfiad S«apia Balnea on 6ha 1978 Hortb Dakota Baelay Data 
Sat 


Stratua Sapiant Sagaanta 

Saapla Saapla Only 

Siaa Siaa Partially a ^ 

Idantifiad Pa 


Batiouitort 

MLB 

With Without 

WLS 

With Without 

MLB 

With Withoun:, 

WLS 

With Without 

5 1 

.0183 .0189 

.0182 .0189 

.0426 .0475 

.0426 .0475 

2 

.0205 .0220 

.0203 .0220 

.0446 .0579 

.0448 .0579 

3 

.0239 .0273 

.0232 .0273 

.0463 .0689 

.0465 .0689 

4 

.0333 .0389 

.0345 .0389 

.0515 .0103 

.0536 .0103 

10 2 

.0116 .0120 

.0116 .0120 

.0294 .0326 

.0294 .0326 

4 

.0136 .0149 

.0136 .0149 

.0282 .0380 

.0283 .0380 

6 

.0163 .0187 

.0160 .0187 

.0316 .0501 

.0319 .0501 

8 

.0229 .0276 

.0227 .0276 

.0348 .0689 

•0355 .0689 

15 3 

.0092 .0098 

.0091 .0098 

.0219 .0264 

.0219 .0264 

6 

.0103 .0119 

.0102 .0119 

.0225 .0324 

.0225 .0324 

9 

.0129 .0156 

.0125 .0156 

.0231 .0403 

.0231 .0403 

12 

.0173 .0216 

.on .0216 

.0263 .0590 

.0265 .0590 


5 

1 

.0169 .0176 

.0168 .0176 

.0408 .0466 

.0408 .0466 


2 

.0190 .0203 

.0189 .0203 

.0414 .0540 

. .0415 .0540 


3 

.0233 .0265 

.0227 .0265 

.0441 .0679 

.0445 .0679 


4 

.0327 .0385 

.0356 .0385 

.0488 .0996 

.0521 .0996 

10 

2 

.0099 .0105 

.0100 .0105 

.0239 .0281 

.0239 .0281 


4 

.0123 .0137 

.0123 .0137 

.0238 .0339 

.0238 .0339 


6 

.0149 .0176 

.0147 .0176 

.0254 .0455 

\.0253 .0455 


8 

.0217 .0260 

.0220 .0260 

.0305 .0646 

.0306 .0646 
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APPENDIX A 

HONTE CARLO STUDY USING THE 
1977 SPRING WHEAT DATA SET 
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summary: when the simulated stratum COMTAIMS 20 SEGHENTSf 

THE SE6MEHT SiHPLE SIZE IS 5 AMD THE NUMBER OF PARTIALLY IDENTIFIED SEKSHTS IS 
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summary: when the simulated stratum contains 20 SEGMENTS f 

THE SEGMENT SAMPLE SI?*^ IS 5 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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SUMHARY: UHEN the simulated stratum contains 20 SEOMENTSr 

THE SEGMENT SAMPLE SIZE IS 5 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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summary: UHEN the simulated stratum contains 20 SEONENTSr 

THE segment sample SIZE IS 5 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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summary: when the simulated stratum contains 20 SEGMENTS* 

THE SEGMENT SAMPLE SIZE IS 10 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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THE TRUE SUPER POPULATION PROPORTIONS HERE : 0.05279 0.25134 0.69587 


summary: when the simulated 5TRATUH contains 40 SEGMENTSf 

THE SEGMENT SAMPLE SIZE IS 5 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGfEMTS IS 
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summary: when the simulated stratum contains 40 SEGHENTSf 

THE SEGMENT SAMPLE SIZE IS lb AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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summary: UHEN the simulated .stratum contains 40 SEGMENTS* 

THE SEGMENT SANfLE SIZE IS 10 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGiSHTS IS 
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summary: UHEM the SIMUItATED .stratum contains 40 SEGMENTSf 

THE SEGMENT SAMPLE SIZE IS 10 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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summary: UHEN T»€ simulated stratum contains 40 SESiSNTS* 

THE SEGMENT SAMPLE SIZE IS IS AND THE NUMDER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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SUHMARY; when the simulated stratum contains 20 SEGMENTS » 

THE SEGMENT SAMPLE SIZE IS 5 AND THE NUMBER OF PARTIALLY IDENTIFIER SEGMENTS IS 
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SUMHARY: when the SIHULATED ^stratum contains 20 SE6HENTS» 

THE SEGMENT SAMPLE SIZE IS 5 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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summary: when the SIMUUATED .stratum CONTAIHS 20 SEGMENTS^ 

THE SEGMENT SAMPLE SIZE IS 10 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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SUHMARY; UHEM the SIMUItATED ^stratum contains 20 SEGMENTS» 

THE SEGMENT SAMPLE SIZE IS 10 AND THE NUMBER OF PARTIALLY IDENTIFIED SEGMENTS IS 
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summary; when the simulated stratum contains 20 SEGMEMTSf 

THE SEGMENT SAMPLE SIZE IS 3?0 AND THE NUMBER OF PARTIALLY IDENTIFIED SEKCENTS IS 
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